A Comparison of Superposition Coding Schemes 



Lele Wang, Eren §a§oglu, Bernd Bandemer, and Young-Han Kim 
Department of Electrical and Computer Engineering 
University of California, San Diego 
La Jolla, CA 92093, USA 
Email: {lewOOl, esasoglu, bandemer, yhk}@ucsd.edu 



o 



CZ2 

o 



> 

00 

in 

(N 

o 

m 



x 



Abstract — There are two variants of superposition coding 
schemes. Cover's original superposition coding scheme has code 
clouds of the identical shape, while Bergmans 's superposition 
coding scheme has code clouds of independently generated 
shapes. These two schemes yield identical achievable rate regions 
in several scenarios, such as the capacity region for degraded 
broadcast channels. This paper shows that under the optimal 
maximum likelihood decoding, these two superposition coding 
schemes can result in different rate regions. In particular, it 
is shown that for the two-receiver broadcast channel, Cover's 
superposition coding scheme can achieve rates strictly larger than 
Bergmans's scheme. 



I. Introduction 

Superposition coding is one of the fundamental building 
blocks of coding schemes in network information theory. This 
idea was first introduced by Cover in 1970 at the IEEE 
International Symposium on Information Theory, Noordwijk, 
the Netherlands, in a talk titled "Simultaneous Communi- 
cation," and appeared in his 1972 paper [6]. Subsequently, 
Bergmans [2] adapted Cover's superposition coding scheme 
to the general degraded broadcast channel (this scheme is 
actually applicable to any nondegraded broadcast channel), 
which establishes the capacity region along with the converse 
proof by Gallager [11]. Since then, superposition coding 
has been applied in numerous problems, including multiple 
access channels [12], interference channels [3], [5], [13], relay 
channels [7], channels with feedback [8], [15], and wiretap 
channels [4], [9]. 

In a nutshell, the objective of superposition coding is to 
communicate two message simultaneously by encoding them 
into a single signal in two layers. A "better" receiver of the 
signal can then recover the messages on both layers while a 
"worse" receiver can recover the message on the coarse layer 
of the signal and ignore the one on the fine layer. 

On a closer look, there are two variants of the super- 
position coding idea in the literature, which differ in how 
the codebooks are generated. The first variant is described 
in Cover's original 1972 paper [6]. Both messages are first 
encoded independently via separate random codebooks of 
auxiliary sequences. To send a message pair, the auxiliary 
sequences associated with each message are then mapped 
through a symbol-by-symbol superposition function (such as 
addition) to generate the actual codeword. One can visualize 
the image of one of the codebooks centered around a fixed 
codeword from the other as a "cloud" (see the illustration in 




(a) Homogeneous coding 



(b) Heterogeneous coding 



Figure 1. Superposition codebooks for which (a) the structure 
within each cloud is identical and (b) the structure is nonidentical 
between clouds. Codewords (dots) are annotated by "mi, 7712", where 
m 1 is the coarse layer message and 7712 is the fine layer message. 



Figure 1(a)). Since all clouds are images of the same random 
codebook (around different cloud centers), we refer to this 
variant as homogeneous superposition coding. Note that in 
this variant, both messages enter on an equal footing and 
the corresponding auxiliary sequences play the same role. 
Thus, there is no natural distinction between "coarse" and 
"fine" layers and there are two ways to group the resulting 
superposition codebook into clouds. 

The second variant was introduced in Bergmans's 1973 
paper [2]. Here, the coarse message is encoded in a random 
codebook of auxiliary sequences. For each auxiliary sequence, 
a random satellite codebook is generated conditionally inde- 
pendently to represent the fine layer message. This naturally 
results in clouds of codewords given each such satellite 
codebook. Since all clouds are generated independently, we 
refer to this variant as heterogeneous superposition coding. 
This is illustrated in Figure 1(b). 

A natural question is whether these two variants are funda- 
mentally different, and if so, which of the two is preferable. 
Both variants achieve the capacity region of the degraded 
broadcast channel [2]. For the two-user-pair interference 
channel, the two variants again achieve the identical Han- 
Kobayashi inner bound (see [13] for homogeneous superposi- 
tion coding and [5] for heterogeneous superposition coding). 
Since heterogeneous superposition coding usually yields a 
simpler characterization of the achievable rate region with 
fewer auxiliary random variables, it is tempting to prefer this 



variant. 

In contrast, we show in this paper that homogeneous 
superposition coding always achieves a rate region at least as 
large as that of heterogeneous superposition coding for two- 
user broadcast channels, provided that the optimal maximum 
likelihood decoding rule is used. Furthermore, this dominance 
can be sometimes strict. Intuitively speaking, homogeneous 
superposition coding results in more structured interference 
from the undesired layer, the effect of which becomes tangible 
under optimal decoding. 

The rest of the paper is as follows. In Section II, we formally 
define the two variants of superposition coding schemes and 
present their respective rate regions. In Section III, we com- 
pare these rate regions. Additional remarks are provided in 
Section IV. 

Throughout the paper, we closely follow the notation 
in [10]. In particular, for X ~ p(x) and e £ (0,1), we 
define the set of e-typical n-sequences x n (or the typical set in 
short) [14] as % {n) {X) = {x n : \#{i : x t = x}/n-p(x)\ < 
ep(x) for all x £ X}. 

II. Rate Regions for the Two-Receiver BC 

Consider a two-receiver discrete memoryless broadcast 
channel depicted in Figure 2. The sender wishes to communi- 
cate message M\ to receiver 1 and message M 2 to receiver 2. 
We define a (2 nRl ,2 nR2 ,n) code by an encoder x n (m 1,7712) 
and two receivers 777,1(7/™) and rh 2 (y 2 ). We assume the mes- 
sage pair (Mi, M 2 ) is uniform over [1 : 2 nRl ] x [1 : 2" i?2 ] and 
independent of each other. The average probability of error 
is defined as pj ,l) = P{(Mi,M 2 ) ^ (M 1: M 2 )}. A rate pair 
(i?i, R 2 ) is said to be achievable if there exists a sequence of 
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Figure 2. Two-receiver broadcast channel. 

We now describe the two superposition coding techniques 
for this channel and compare their achievable rate regions 
under optimal decoding. 

A. Homogeneous Superposition Coding (UV Scheme) 

Codebook generation: Fix a pmf p(u)p(v) and a func- 
tion x(u,v). Randomly and independently generate 2 nRl 
sequences u n (mi), mi 6 [1 : 2 nRl ], each according to 
Y[- =1 Pu{ui), and 2 nR2 sequences v n (m 2 ), m 2 e [1 : 2 nR *], 
each according to Yli=i Pv( v i)- 

Encoding: To send the message pair (7711,777,2), transmit 
Xi {ui (mi), Vi{m 2 )) at time i £ [1 : 77]. 

Decoding: Both receivers use simultaneous nonunique de- 
coding, which is rate-optimal in the sense that it achieves the 
same rate region as maximum likelihood decoding [1] under 



the codebook ensemble at hand. In particular, upon receiving 
y n , receiver 1 declares 7771 is sent if it is the unique message 
such that 

for some 777,2. If there is no unique 777,1, it declares an error. 
Similarly, upon receiving y 2 , receiver 2 declares 7772 is sent if 
it is the unique message such that 

(u n {m^v n {m 2 )^) eT £ (n) 

for some mi. If there is no unique 7772, it declares an 
error. Standard typicality arguments show that receiver 1 will 
succeed if 

R 1 +R 2 <I(X-Y 1 ) 
Ri < I(U\Y{\ or x (1) 

or, equivalently, if 

Ri < 7(X;Yi|V) 
Ri + mm{R 2 ,I(X; Y x \ U)} < I(X; Y x ). 

Similarly, receiver 2 will succeed if 

i?i +i? 2 < I(X:Y 2 ) 
R2<I(V;Y 2 ) or (2) 
K 1 R 2 <I(X;Y 2 \U), 

or, equivalently, if 

R 2 < I(X; Y 2 I U) 
R 2 + min^!, I{X; Y 2 | V)} < I(X; Y 2 ). 

The regions for both receivers are depicted in Table 1 . Letting 
Muvip) denote the set of rates (Ri, R 2 ) satisfying (1) and (2), 
it follows that the rate region 



wv 



= co( |J Muvip)) 



is achievable. Here, co(-) denotes convex hull, and Vuv is the 
set of distributions of the form p = p(u) p(v) p(x\u, v) where 
p(x\u,v) represents a deterministic function. 

B. Heterogeneous Superposition Coding (UX Scheme) 

Codebook generation: Fix a pmf p(u,x). Randomly and 
independently generate 2 nRl sequences u n (mi), mi G [1 : 
2 nRl ], each according to Y\^ = iPu{ u i)- F° r each message 
To! e [1 : 2 nRl ], randomly and conditionally independently 
generate 2 nR2 sequences x n (m 1 ,m 2 ), m 2 G [1 : 2 nR2 ], each 
according to ]Xi=iPx\u{xi\ui{rni)). 

Encoding: To send (mi,m 2 ), transmit x n (mi,m 2 ). 

Decoding: Both receivers use simultaneous nonunique de- 
coding, which is rate-optimal as we show below. In particular, 
upon receiving y", receiver 1 declares 7771 is sent if it is the 
unique message such that 

( U "(m 1 ), a; ™(m 1 ,m 2 ),yr) &% {n) 



Receiver 1 



Receiver 2 



&uv(p) Ri <I(X-Y 1 \V) R 2 < I(X;Y 2 \U) 

p = p{u)p(v)p(x\u, v) R l + min{i? 2 , I(X; Y l \ U)} < I(X; Yl) R 2 + min^!, I(X; Y 2 \V)} < I(X; Y 2 ) 




Table 1. Rate regions for homogeneous and heterogeneous superposition coding. 



for some m 2 . If there is no unique mi, it declares an error. 
Similarly, upon receiving > receiver 2 declares rh 2 is sent if 
it is the unique message such that 

{u n {m l ),x n {m l ,rh 2 ) : y^) e T ^ 

for some mi. If there is no unique rh 2 , it declares an error. 
Standard arguments show that receiver 1 will succeed if 

R x <I{U;Yi) or R x + R 2 < I(X; Yl), (3) 

or, equivalently, if 

Ri + min{R 2 , 1(X; Y x \ U)} < I{X; Yl). 

Following an analogous argument to the one in [1], it can 
be shown that this region cannot be improved by applying 
maximum likelihood decoding. 
Receiver 2 will succeed if 

R 2 < I(X;Y 2 \U) 
Ri+R 2 <I(X;Y 2 ). 

In the Appendix, we show that this region cannot be improved 
by applying maximum likelihood decoding. The regions for 
both receivers are depicted in Table 1 . Let £%ux {p) denote the 
set of all (Ri,R 2 ) pairs satisfying both (3) and (4). Clearly, 
the rate region 




is achievable. Here, Vjjx is the set of distributions of the form 

p=p(u,x). 

If the roles of mi and m 2 in code generation are reversed, 
one can also achieve the region Myx = co(U p ^vx(p)) 
obtained by swapping Y\ with Y 2 and Rl with R 2 in the 
definition of £%ux(p)- 

It is worth reiterating that the two schemes above differ 
only in the dependence/independence between clouds around 
different u n sequences, and not in the underlying distributions 
from which the clouds are generated. Indeed, it is well known 
that the classes of distributions Vux an d Vjjv are equivalent 
in the sense that for every p(u,x) £ Vux, there exists a 
q(u) q(v) q(x\u, v) £ Vuv such that J2 V l(u) q(v) q{x\u, v) = 
p(u,x) (see for example [10, p. 626]). 

III. Main Result 

Theorem 1. The rate region achieved by homogeneous super- 
position coding includes the rate region achieved by hetero- 
geneous superposition coding, i.e., 

Moreover, there are channels for which the inclusion is strict. 

Proof: Due to the convexity of Mjjv an d the symme- 
try between UX and VX coding, it suffices to show that 
£%ux{p) C Muv for all p e Vux- Fix any p e Vux- Let 
q' G Vuv be such that U = X, V = 0, and the marginal 
on X is preserved q'(x) = p(x). Let q" € Vuv be such 




I(X;Y 2 \U) 
I(X;Yi\U) 




I(X;Y 2 \U) 
I{X;Y X \U) 



I(X;Yi\V) 

(a) Rate region in (6). (b) Rate region in (7). 

Figure 3. Rate regions for the proof of Theorem 1 . 



that V = X, U = 0, and the marginal on X is preserved 
q"(x) = p(x). An inspection of (l)-(4) and Table 1 reveals 
that Muv(q') is the set of rates satisfying 

R 2 = 

Ri </(X;li), 
and £%uv(q") is the set of rates satisfying 
i?i = 

R 2 <I(X;Y 2 ). 

It then follows that co(M uv (q') UM uv (q")) includes the rate 
region 

R x + R 2 <min{7(X;y 1 ),/(X;y 2 )}. (5) 

We will consider three cases and show the claim for each. 

• If I(X; Yi) > I(X; Y 2 ) then M V x(p) reduces to the rate 
region 

R 2 < I(X: Y 2 | U) 
R 1 + R 2 <I(X;Y 2 ), 

which is included in the rate region in (5), and therefore 
in 3$ uv . 

• If I{X;Yi) < I(X;Y 2 ) and I(X ; ^ | U) > I(X;Y 2 \U), 
then Mux (p) reduces to the rate region 

R 2 <I(X-Y 2 \U) 
R 1 +R 2 <I{X;Y 1 ), 

which is also included in the rate region in (5), and 
therefore in &uv- 

• JfI(X;Yi) < I(X;Y 2 ) and I(X; F x | U) < I(X;Y 2 \U), 
then &ux(p) reduces to the rate region (see Figure 3(a)) 

R 2 <I(X;Y 2 \U) 

i?! + min{i? 2 , 1(X; Y 1 \ U)} < I(X; Y^. 

Find a q 6 Vuv with q(u, x) = p(u,x), and note that 
&uv{q) is described by the bounds 

R 2 <I(X;Y 2 \U) 

Ri^IiX-Y^V) (7) 

Ri +min{R 2 ,I(X;Y 1 \ U)} < 7(X;Fi). 



Comparing (6) with (7) (Figure 3(b)), one sees that 
^ux(p) C co(M uv {q) \JM uv {q')). This proves the 
first claim of the theorem. 
Now consider the vector broadcast channel with binary inputs 
{Xi,X 2 ) and outputs (Yi,F 2 ) = (X 1 ,X 2 ). For all p £ P ux , 
we have from (4) that Ri + R 2 < I{X 1 X 2 ;Y 2 ) < 1, and 
similarly for all p G Vvx- Thus, (Mux UMvx) is included 
in the rate region R\+R 2 < 1. Note, however, that the rate pair 
(1,1) is achievable using the UV scheme by setting U = X\ 
and V = X 2 . This proves the second claim. ■ 

IV. Discussion 

In addition to the basic superposition coding schemes pre- 
sented in Section II, one can consider coded time sharing [10], 
which could potentially enlarge the achievable rate regions. 
In the present setting, however, it can be easily checked 
that coded time sharing does not enlarge Mux- Thus, the 
conclusion of Theorem 1 continues to hold and homogeneous 
superposition coding with coded time sharing outperforms 
heterogeneous superposition coding with coded time sharing. 

Appendix 
Optimality of the Rate Region in (4) 

We show that no decoding rule for receiver 2 can achieve 
a larger rate region than the one in (4) given the codebook 
ensemble of heterogeneous superposition coding. To this end, 
denote the random codebook by 

C = (U n (l), U n (2), X n (l, 1), X n (l, 2), . . . ). 

By the averaged version of Fano's inequality in [1], 

H(M 2 \Y 2 n ,C)<ne n , (8) 

where e„ — > as n —5- oo. Thus, 

nR 2 = H(M 2 ) 

<I(M 2 ;Y 2 n \C) + ne n 

= /(Af 2 ;r 2 n |C,Af 1 ) + n e „ 

= H(Y 2 n | C, Mr) - H(Y 2 n | C, M U M 2 ) + ne n 



(b) 

< nH{Y 2 | U) - H{Y 2 | X) + ne n 
= I(X;Y 2 \U) + ne n , 

where (a) follows by providing Mi to receiver 2 as side 
information from a genie and (b) follows from the codebook 
ensemble and the memoryless property. 

To see the second inequality, first assume that 

Rx<I(X;Y 2 ). (9) 

After receiver 2 has recovered m2, the codebook given this 
message reduces to 

C' = (X n (l,m 2 ),X n (2,m 2 ),X n (3,m 2 ),. . .). 

These codewords are pairwise independent since they do not 
share common U" sequences, and thus C is a nonlayered 
random codebook. Since (9) holds, receiver 2 can reliably 
recover Mi by using, for example, a typicality decoder. Thus, 
by (8), 

H(M 1 ,M 2 | Y 2 n ,C) = H(M 2 | Y 2 n ,C) + H(M 1 \ Y 2 n , C, M 2 ) 
< 2ne n . 

Hence 

n(R 1 + R 2 ) = H(Mi,M 2 ) 

<I(M 1 ,M 2 ;Y 2 n \C)+2ne n 
<nI{X;Y 2 ) + 2ne n . (10) 

To conclude the argument, assume there exists a decoding rule 
that achieves a rate point (Ri, R 2 ) with R\ > I(X: Y 2 ). Then, 
this decoding rule must also achieve (R[, R' 2 ) = (I(X; Y 2 ) — 
R 2 /2, R 2 ), a rate point that is dominated by (R±, R 2 ). Since 
R[ < I(X;Y 2 ), by our previous argument, (R[,R 2 ) must 
satisfy (10). It does not, which yields a contradiction. 
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